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Abstract — The theoretic results of MIMO capacity tell us that 
the higher the number of antennas are employed, the higher the 
transmission rate is. This makes MIMO systems with hundreds 
of antennas very attractive but one of the major problems 
that obstructs such large dimensional MIMO systems from the 
practical realization is a high complexity of the MIMO detector. 
We present in this paper the new soft output MIMO detector 
based on matched filtering that can be applied to the large MIMO 
systems which are coded by the powerful non-binary LDPC 
codes. The per-bit complexity of the proposed detector is just 
0.28% to that of low complexity soft output MMSE detector and 
scales only linearly with a number of antennas. Furthermore, the 
coded performances with small information length 800 bits are 
within 4.2 dB from the associated MIMO capacity. 

I. Introduction 

Over recent years, the multiple input multiple output 
(MIMO) systems which employ tens to hundreds transmit- 
ted/received antennas have been a subject of considerable 
interest (TJ-|[3j. Without additional bandwidth and transmitted 
power requirements, such large MIMO systems can provide 
higher transmission rate simultaneously with better reliability. 
The optimal maximum likelihood (ML) detector is known to 
be infeasible for large MIMO systems since its complexity 
grows exponentially with the number of transmitted antennas 
and the modulation order. Therefore, the research on designing 
the low complexity detector for large MIMO systems is 
garnering much attention j4). 

In this paper, we intend to deal with the coded large MIMO 
systems in which the operating region, i.e., BER of 10 -5 , is in 
low SNRs. We thus consider only the classical linear detectors 
such as matched filtering (MF) or minimum mean square error 
(MMSE) techniques since the detection performance of these 
linear detection schemes at low SNRs seems to be near optimal 
Q. Most importantly, these kinds of detectors have very low 
complexity and can be applied to large MIMO system. 

Although practical coded large MIMO systems have been 
proposed in the literature, the performance gap to the capacity 
remains large |6j, (7). For example, the performance of turbo 
coded large MIMO systems with likelihood ascent search 
(LAS) detection [6| is more than 7.5 away from its associated 
capacity. We have recently shown in our previous study that 
non-binary LDPC coded large MIMO systems with soft output 
MMSE detection and small code length can be employed 
to reduce the remaining gap to just 3.5 dB with affordable 
complexity [8|. However, MMSE detector needs to calculate 



the inverse of large and dense matrix and this calculation is 
known to has the cubic complexity with a number of received 
antennas |9). To avoid such matrix inversion, we propose 
in this paper the soft output MIMO detector based on MF 
technique for large MIMO systems which are coded by non- 
binary LDPC codes. The complexity analysis provided in this 
paper indicates that the detection complexity of the proposed 
detector is very low, i.e., it is just 0.28% to that of the soft 
output MMSE detection while the loss in coded performance, 
compared to the case of employing MMSE detection, is 
marginal especially at very low SNRs. 

The rest of this paper is organized as follows. We first 
describe the system model in Section II. In Section III, 
the proposed soft output MF-based detector is presented. In 
Section IV, we present the performance of the non-binary 
LDPC coded large MIMO systems which employ the proposed 
detector. In Section V, the complexity analysis of the proposed 
detector is provided. Finally, the conclusions are given. 

II. System Model 

In this study, N t x N r MIMO system represents the MIMO 
system with N t transmit antennas and N r receive antennas. 
Let A M be the complex modulation constellation of size M = 
2 P where p represents bit(s) per modulated symbol. Figure [T] 
shows the MIMO system concatenated with non-binary LDPC 
codes of rate R = K/N. In this paper, non-binary LDPC 
codes defined over GF(2 8 ) are chosen because the excellent 
performance can be achieved with simple code structure [8|, 
p0| , i.e., ultra sparse regular parity-check matrix. 
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Fig. 1. The MIMO system concatenated with non-binary LDPC codes. 

At the transmitter side, a bit to symbol mapper maps a 
group of 8 information bits to a non-binary symbol in GF(2 8 ). 
The non-binary LDPC encoder then encodes the stream of 



K symbols into a codeword of length N symbols. Each 
coded symbol in GF(2 8 ) is then mapped to a group of q = 
S/p modulated symbols by a constellation mapper. At each 
time instant (each channel use), the transmitter simultaneously 
sends N t — K t q modulated symbols in parallel through N t 
transmit antennas where K t is a number of coded symbols per 
each transmission. Let s = [si, S2, . . . , sjvJ t € C Nt be the 
transmit signal vector. Each entry Sj, Vi E {1, . . . , N t } taken 
from A M must satisfy the component-wise energy constraint 
E[||si|| 2 ] = E s /N t where E s is the total transmitted power, 
|| • || 2 denotes the squared Euclidean norm, and E[ ] denotes 
the expectation. 

Considering a baseband discrete time model for uncorre- 
lated fiat fading MIMO channel, the received vector y = 
[yi,2/2, ■ • ■ ,yN r ] T € C Nr of the N t x N r MIMO system is 
given by |6j 

y = Hs + n. (1) 

The matrix H = [HiH 2 ...HjvJ € C^*^ denotes the 
channel fading matrix whose entry hkj is assumed to be 
complex Gaussian random variable with zero mean and unit 
variance. The vector n = [n-i, r&2, • • • , n>N r ] T <= < C Nr is a 
noise vector whose entry is a complex white Gaussian noise 
with zero mean and variance cr 2 per real component. The 
MIMO detector performs detection and produces the prior 
probabilities (soft output) for non-binary LDPC decoder. After 
the decoder has all information for N coded symbols, the 
decoder performs decoding and then provides the estimated 
non-binary symbols (hard output). These estimated symbols 
are finally demapped to a sequence of estimated information 
bits. In this paper, the channel matrix H is assumed to be 
known at the receiver. 

Since each entry of H has unit variance, the average signal 
energy per receive antenna is E s . We follow the convention 
that Nq/2 = er 2 to define the signal to noise ratio. In this 
setting, the average signal to noise ratio (SNR) per receive 
antenna, denoted by 7, is given by [TT| 

E s _ E s 
^0 ~ 2of" 



as MMSE detector. For the MMSE detector, it is needed to 



7 = 



(2) 



The spectral efficiency (net information rate per unit band- 
width) of coded MIMO system with spatial multiplexing 
technique is pRN t where p represents a number of bits per 
modulated symbol and R is code rate [11]. With a perfect H 
at the receiver side, the ergodic MIMO capacity is given by 

C = E [tog 2 det (Ijv, + ( 7 /JV t ) HH H )] , (3) 

where the superscript H denotes the Hermitian transpose 
operator, det denotes the determinant and Ijy r is the identity 
matrix of size N r x N r . The unit of spectral efficiency and 
capacity is bits/sec/Hz (bps/Hz). 

III. MIMO Detector based on Matched Filtering 

Before going to describe the proposed detector, let us 
introduce a class of low complexity linear detection known 



compute the weight matrix W which is given below 1 13 1, 

W mmse = (lffl H + ^7^IiV r ) H H . (4) 

The MMSE detector then estimates s, of the transmitted 
symbol on ith antenna by multiplying the received vector y 
with the ith row of W mmse , 



W 



(5) 



where W mmse i is the ith row of W mmse . Finally, §{ is sliced 
to obtain the hard estimation. 

For large MIMO systems, a major problem of MMSE detec- 
tion is the matrix inversion which has cubic time complexity 
|9|. In this paper, we avoid the matrix inversion by considering 
MF-based detection. Although the MF technique for MIMO 
detection is well known in literature but we present in this 
paper a novel soft output generation from MF-based detection 
for non-binary LDPC decoder which has not yet reported 
elsewhere. 

A. Detection based on Matched Filtering 

A baseband discrete time channel model given in ([T| can 
be equivalently rewritten as 



y = HiA-i + H 2 s 2 



■ Hzv t Sjv t + n. 



(6) 



To estimate §k of the transmitted symbol on the kth antenna 
by the concept of matched filtering, the interference streams 
in |6]l are treated as noise, 



(7) 



Then, the MIMO detector based on MF technique estimates Sk 
of the transmitted symbol on the fcth antenna by multiplying 
the received vector y with weight matrix 



H" 



H, Hi, 



s k = W fe y, 



s k + W fc J2 H i fi i + W fe«- 

i— l,i^k 



(8) 



In contrast of the conventional MF detection, we make a 
little modification by introducing the term HH 1 H into weight 
matrix. After slicing s^, we can obtain the estimated symbol 
transmitted from the kth antenna. It is known that the detection 
performance (uncoded performance) of match filtering is very 
poor since there is no cancellation of the interference terms 
(the second term in R.H.S of ([8])). However, matched filtering 
is near-optimum if n is dominant [5|, i.e. at low SNRs. 

B. Generation of Soft Output 

Assuming all streams and noise are statistical independence 
and Gaussian distributed, the proposed method for generating 
soft output from MF-based detection will be described as 
follows. Following the definition of the post detection signal 



to interference plus noise ratio (SINR) given in p4) p. 358], 
the SINR of the proposed detection for kth stream, denoted 
by 5k, can be expressed as 



i = l,i^k 



E[||s fc || 2 ] 
|W fc H, Si || 2 ] - 
Es/N t 



E[||W fc n|| 



(9) 



E./Nt 
E s /Nt 
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+ (2o-2||W fc 



The denominator, denoted by Afe, given in (|9]l can be ap- 
proximated by a Gaussian random variable. Based on this 
approximation, we set a\ = Afc and the soft output which 



exactly is the likelihood of Sfc conditioned on s £ 
follows 

p (h I s) = J-^j exp y—^sWh ~ s\ 



(10) 



To justify that Afc distributes like Gaussian, we estimate 
the probability density function (pdf) of random variable Afc 
by mean of Kernel density estimation [15|. By using 10,000 
channel realizations, it is obviously seen from Fig. [2] that 
the pdf of Afc closely agrees with the shape of Gaussian 
distribution. The mean just deviates from zero by 0.0209. 
Therefore, we conclude that the Gaussian approximation of 
Afc described above is reasonable. 
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Fig. 2. The pdf estimation of A k from 200 X 200 MIMO channel with 
BPSK modulation at SNR per receive antenna 7 = -2 dB. 



C. Simplification 

To reduce the computation complexity of detection, we 
observe that E [H^Hfc] ss N r . Thus, the computation of Wfc 
can be simplified as follows 

Wfc =Hg(l/JV r ). (11) 

With this simplification, the matrix multiplication H^Hfc 
whose an complexity is 0{N r ) [16] can be reduced to just 
a constant term. 



To reduce the computational complexity of soft output 
generation, we first observe that E [||Wfc|| 2 ] ss 1/N r . Then, 
we assume that the power of noise term is dominant, i.e., 
2c 2 ^> E s , With this assumption, we have the following 
expression 



N t 



EslN t ^ 



|WfcH, i j| 2 «2 ( r I 2 JWfc|| 2 . 



The above expression is valid when 7 is low, e.g., 7 = -5 

N t 

dB, since 1/N t £ ||WfcHJ 2 < ||Wfc|| 2 . Therefore, the 

i— l,i^k 

computation of variance cr 2 for soft output generation defined 



is as in ( fTT)| ) can be reduced to a constant term 



°l =E S /N t £ ||WfcH 4 

«2a 2 ||Wfc|| 2 , 
« 2a 2 jN r . 



(2a 2 ||Wfc|| 2 ), 



(12) 

The variance cr 2 for soft output generation is now independent 
of k and can be easily pre-computed. 

IV. Numerical Results 

In this section, the numerical results of the uncoded and 
the coded large MIMO systems are presented. The non-binary 
LDPC codes defined over GF(2 8 ) with regular parity-check 
matrix of column weight two are deployed as channel code. 
The decoder is implemented by FFT-based belief propagation 
algorithm |17| with maximum iteration 200. The BPSK mod- 
ulation is employed for transmission. We shall refer to non- 
binary LDPC coded large MIMO system with the proposed 
MF-based detector as the proposed system for simplicity. 

Figure [3] displays the uncoded performance of various 
low complexity linear detection schemes at low SNRs. The 
uncoded peformance of single input single output (SISO) 
unfaded AWGN-BPSK system is also shown in the figure to 
represent the approximated lower bound for ML performance 
[ 3 1 . Major observations can be summarized as follows : 

• There is no performance difference between MF detection 
and its corresponding simplified version. 

• The performance of MF detection is seriously degraded 
comparing to that of MMSE detection when 7 gets higher. 

• The performance of MMSE and MF detections at low 
SNRs achieves near the performance of SISO unfaded AWGN. 
Thus, both are near optimal in low SNR region. 

• The zero forcing (ZF) detection performs very poor in low 
SNRs so we do not pay the attention to this type of detector. 

It is thus intuitive to think that the proposed MF-based 
detector would be useful when operating in low SNR region, 
i.e. at near MIMO capacity. 

Figure [4] shows the performance comparison between the 
proposed system and the non-binary LDPC coded system with 
soft output MMSE detector when the number of transmitted 
antennas is N t = 200. The generation of soft outputs from 
MMSE detector are given in JTSJ. The code rates of non- 
binary LDPC codes are ranging from 1/12 to 1/3 and the 
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Fig. 3. Uncoded performance of MMSE, ZF, MF, and simplified MF detectors 
in 200 X 200 MIMO system with BPSK modulation. 



information length is 800 bits (K = 100 symbols). We select 
the low rate channel codes since we expect that the operating 
region, e.g. BER of 10~ 4 , will be occurred in low SNRs at 
which the proposed detection is near optimal. For R = 1/3, 
the performance of coded system with MMSE detector is 
better than that of the proposed system by about 0.8 dB. This 
is because the uncoded performance of MF-based detector 
is worse than that of MMSE detector by roughly 2 dB as 
shown in the previous figure. The performance difference 
between two coded systems (one with MMSE and another 
one with MF) is vanished for R = 1/6,1/9 and 1/12 
since, in low SNRs, both detectors provides almost the same 
detecting performance. Note that the codes with R < 1/3 are 
constructed by the instruction given in fl9) . 
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Fig. 4. BER of the non-binary LDPC coded 200 X 200 MIMO systems 
with BPSK modulation. "Propose" means the non-binary LDPC coded system 
with the proposed soft output MF-based detection and "MMSE" represents 
the non-binary LDPC coded system with soft-output MMSE detection. 



Figure [3] illustrates the performance of the proposed system 
in terms of spectral efficiency. We plot the SNR points of 
the proposed system required to reach the BER of 10 -4 . For 
spectral efficiency below 66.67 bps/Hz, the proposed system 
can operate within 4.2 dB from the associated MIMO capacity. 
Thanks to the large N t , even we code the system with low 
rate non-binary LDPC codes but the high data rate still can 
be obtained. For example, 1 Gbps over bandwidth of 50 MHz 
can be obtained from R =l/9 coded 200 x 200 MIMO-BPSK 
systems. More interestingly, the reliable communications with 
very high spectral efficiency, about 16 bps/Hz, and extremely 
low SNR, roughly -10 dB, are unprecedented in literature. 




y : average SNR per receive antenna (dB) 

Fig. 5. Spectral efficiency of the proposed coded 200 X 200 MIMO systems. 
The number in parentheses represents a gap to the capacity. 



V. Complexity Analysis 

We will show in this section through complexity analysis 
that the performance loss from employing the proposed detec- 
tor is marginal when one looks at the complexity reduction. 
Following [9 1, [16, p. 18], the complexity of complex matrix 
calculation is expressed in terms of complex floating point 
operations (flops). The number of flops for some mathematical 
operations is given in Table [I] We note that the detection for 
coded MIMO systems comprises of two steps : 1) detection 
and 2) soft output generation. 

TABLE I 

The number of flops for some operations (9) where a, b e R, 
a,b 6 C, AND c, d e C Nrxl ~^ 



Operation (name) 


Operation (math.) 


Number of flops 


Real multiplication 


ab 


1 


Complex multiplication 


ab 


3 


Real addition 


a + b 


1 


Complex addition 


a + b 


1 


Inner product 


c H d 


4iV r - 1 


Scalar-vector multiplication 


a ■ a 


N r 



A. Proposed Soft Output Detector 

The estimation of s k as defined in §8§ needs two steps : 
1) Computing the weight matrix Wfc according to ( fTT) which 
requires N r flops. 2) Multiplying Wfc with received vector 
y which requires 4N r — 1 flops. Since we have N r received 
antennas, the total flops used for detection are 5N!j! — N r . 
Thus, the per-bit complexity is 0(N r ). To obtain the soft 



output defined in ( 10 1, we need one flop for computing squared 
euclidean norm, one flop for subtraction, 3 flops for multipli- 
cation of real constant, and assuming 50 flops for exponential 
calculation pO} p.193]. Each transmitted symbol has M levels 
to be calculated and we have N t — N r transmitted symbols. 
Therefore, in total, 55MN r flops are required for generating 
soft outputs. In summary, the overall per-bit complexity for 
both detection and generating soft output is 0(N r ). 

B. Soft Output MMSE Detector 

In the case of MMSE detector, it has been shown that the 
complexity for detection is 107V; 3 + 5.57V r 2 + 1.5JV r flops ||9). 
For generating of soft outputs, the likelihood of s k conditioned 
on s e A A/ is given by [18|, 



P (h | 8) 



1 



exp 



1 



\Sk - [J<kS\\ 



(13) 



where fi k = 

,2 _ 



mmse k'-'-k which requires 4N r — 1 flops and 



H 

(/ife — /i 2 ) which requires 3 flops. Again, each 
transmitted symbol have M level to be calculated and we have 
Nt = N r transmitted symbols. To calculate soft output defined 
in ( 13 I, AMN^ + 58 M flops are required for generating soft 
outputs from all estimated symbols. 

Table [II] summarises the computational complexity of both 
the proposed detector and the MMSE detector in terms of 
flops. For each transmission in 200 x 200 MIMO system 
with BPSK modulation, 80, 540, 416 flops are required for the 
MMSE detector while we need only 221, 800 flops for the pro- 
posed MF detector (just 0.28 % to the MMSE detection) Thus, 
by using the proposed detector, the computational complexity 
of large MIMO detection is strongly reduced. 

TABLE II 

The number of flops for MIMO detection. 



Type \ Operation 


Detection 


Soft Output 


Proposed detector 


5N? - N r 


55MN r 


MMSE detector 


1W?+5.5N? + l.5N r 


iMN'r + 58M 



VI. Conclusions 

In this paper, we develop the soft output detector based 
on MF technique for non-binary LDPC coded large MIMO 
systems. An advantage of the proposed system is very low 
complexity of detection and soft output generation, i.e., overall 
complexity is just 0.28% when comparing with that of the soft 
output MMSE detector. By using 200 transmit/receive anten- 
nas and BPSK modulation, the performance within 4.2 dB 
from the capacity limit can be obtained at spectral efficiency 



below 66.67 bps/Hz. Due to the low complexity detection, 
one can expect the improved performance by considering 
the joint iterative detection and decoding in turbo fashion. 
The performance of the proposed coded system with higher 
modulation is also not shown in this paper. Both will be further 
explored as the future work. 
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